AITopics | similarity measure

Value-Aware Product Recommendation by Customer Segmentation using a suitable High-Dimensional Similarity Measure

Acosta, María Florencia, Arancibia, Rodrigo García, Llop, Pamela, Lovatto, Mariel, Mansilla, Lucas

arXiv.org Machine LearningMay-1-2026

This paper presents a novel value-aware approach to product recommendation that simultaneously addresses the high dimensionality and sparsity of user-item data while explicitly incorporating the contribution of each product and user to overall sales revenue. The proposed framework encodes revenue contributions in the user-item matrix and computes customer similarity directly on this basis using suitable distance measures. This enables the segmentation of users according to the revenue-based similarity of their purchase baskets and supports recommendations aligned with profitability objectives. We compare conventional similarity metrics with a novel alternative tailored to high-dimensional contexts and propose three recommendation strategies based on revenue share, product popularity, and expected profit generation. The effectiveness of the proposed method is validated through simulation experiments and a real-world application using the UCI Online Retail dataset.

data mining, machine learning, similarity measure, (15 more...)

arXiv.org Machine Learning

2604.26983

Country: Europe > United Kingdom (0.28)

Genre: Research Report > New Finding (0.46)

Industry:

Retail > Online (0.48)
Information Technology > Services (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.68)

Add feedback

Active clustering for labeling training data

Neural Information Processing SystemsApr-25-2026, 16:55:48 GMT

We also algorithm family, propose as a conjecture that they reach the minimum average items and analyze their complexity. In the second model, we analyze a specific the algorithms that minimize the average number of queries required to cluster the independently following a fixed distribution. In the first model, we characterize they form is drawn uniformly, the other one where each item chooses its class items, we consider two random models for the classes: one where the set partition classes (which can be labeled cheaply at the very end of the process). Given the cheap task of answering pairwise queries, and the computer groups the items into for training data gathering where the human experts perform the comparatively to see whether they belong to the same class. Thus motivated, we propose a setting determining the correct labels is much more expensive than comparing two items most practical cases rely on humans-in-the-loop to label the data. The process of has a high impact on the performance of the learned function.

algorithm, artificial intelligence, machine learning, (19 more...)

Neural Information Processing Systems

Country: Europe > United Kingdom > England (0.28)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.46)

Add feedback

080be5eb7e887319ff30c792c2cbc28c-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 10:30:37 GMT

artificial intelligence, machine learning, matrix, (18 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

072fd0525592b43da661e254bbaadc27-Supplemental-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 09:49:23 GMT

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

072fd0525592b43da661e254bbaadc27-Paper-Conference.pdf

Neural Information Processing SystemsApr-24-2026, 09:49:19 GMT

artificial intelligence, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

80b618ebcac7aa97a6dac2ba65cb7e36-Supplemental.pdf

Neural Information Processing SystemsFeb-19-2026, 03:53:23 GMT

batch, fairness, violation, (17 more...)

Neural Information Processing Systems

Country:

Europe > Sweden > Stockholm > Stockholm (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > Canada > Quebec > Montreal (0.04)
(6 more...)

Genre: Research Report (0.66)

Industry: Education > Educational Setting > Online (0.72)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Enterprise Applications > Human Resources > Learning Management (0.71)
Information Technology > Data Science > Data Mining (0.68)

Add feedback

Supplement to " Estimating Riemannian Metric with Noise-Contaminated Intrinsic Distance "

Neural Information Processing SystemsFeb-17-2026, 18:22:43 GMT

Unlike distance metric learning where the subsequent tasks utilizing the estimated distance metric is the usual focus, the proposal focuses on the estimated metric characterizing the geometry structure. Despite the illustrated taxi and MNIST examples, it is still open to finding more compelling applications that target the data space geometry. Interpreting mathematical concepts such as Riemannian metric and geodesic in the context of potential application (e.g., cognition and perception research where similarity measures are common) could be inspiring. Our proposal requires sufficiently dense data, which could be demanding, especially for high-dimensional data due to the curse of dimensionality. Dimensional reduction (e.g., manifold embedding as in the MNIST example) can substantially alleviate the curse of dimensionality, and the dense data requirement will more likely hold true.

artificial intelligence, machine learning, tensor, (19 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.14)
North America > United States > New York > Richmond County > New York City (0.04)
North America > United States > New York > New York County > New York City (0.04)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning in High Dimensional Spaces (0.54)

Add feedback

Estimating Riemannian Metric with Noise-Contaminated Intrinsic Distance

Neural Information Processing SystemsFeb-17-2026, 18:22:40 GMT

The key quantity of interest here is the Riemannian metric, which characterizes the Riemannian geometry and defines straight lines and derivatives on the manifold.

artificial intelligence, machine learning, manifold, (16 more...)

Neural Information Processing Systems

Country: